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Abstract: In this article we study domino tilings of a family of finite regions 
called Aztec diamonds. Every such tiling determines a partition of the Aztec 
diamond into five sub-regions; in the four outer sub-regions, every tile lines up 
with nearby tiles, while in the fifth, central sub-region, differently-oriented 
tiles co-exist side by side. We show that when n is sufficiently large, the 
shape of the central sub-region becomes arbitrarily close to a perfect circle of 
radius n/ \/2 for all but a negligible proportion of the tilings. Our proof uses 
techniques from the theory of interacting particle systems. In particular, we 
prove and make use of a classification of the stationary behaviors of a totally 
asymmetric one- dimensional exclusion process in discrete time. 
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1. Introduction. 



Figure 1 shows a domino tiling of the Aztec diamond of order 64. In 
general, the Aztec diamond of order n is a region composed of 2n(n + 1) 
unit squares, arranged as a stack of 2n centered rows of squares, with the kth 
row having length min(2/c, 4n — 2/c + 2), and a domino is the union of any two 
of these unit squares that share an edge. It can be seen that the tiling shown 
in Figure 1 consists of a roughly circular region in which tile-orientations are 
mixed, surrounded by four regions in which the tiling exhibits a repetitive 
"brick- wall" pattern. In this article we will demonstrate that the large-scale 
structure seen in the figure is no coincidence but is in fact increasingly certain 
to occur as one looks at random tilings of ever-larger Aztec diamonds. This 
behavior is in sharp contrast with the behavior of random domino tilings of 
2n-by-2n squares, which are statistically homogeneous unless one looks quite 
close to the boundary [2] . 

As a first step towards a precise statement of the main result, it is impor- 
tant to note that the four brick- wall patterns are genuinely different from one 
another. One might not at first sec the difference between the brick-pattern 
seen at the top of the diamond and the brick-pattern seen at the bottom, but 
in fact they are out of phase with one another. To see this, imagine coloring 
the squares alternately black and white. Then, scanning from left to right, 
rows in the upper half of the diamond begin with a square of one color while 
rows in the lower half begin with a square of the other color. Thus the two 
horizontal brick- wall patterns extend to distinct tilings of the entire plane; in 
one, every domino has its left square colored black, and in the other, every 
domino has its left square colored white. A similar situations prevails for the 
vertical tiles. 

The recognition of the existence of four rather than two distinct sorts 
of tiles plays a key role in the formulation of the "shuffling algorithm" (de- 
scribed in section 2). Shuffling was introduced in [4] to prove that the Aztec 
diamond of order n has exactly 2"("+^)/^ domino tilings; here we use shuffling 
to generate tilings uniformly at random (Figure 1 was generated in precisely 
this way). 

To lay the groundwork for a description of shuffling, consider the set 
of vertices in a paving of the Aztec diamond of order n by unit squares. 
Figure 2 shows a domino tihng of the Aztec diamond of order 8. As shown 
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in the figure, we mark with a dot the middle vertex along the upper (or 
"northern") border of the Aztec diamond, and we also mark with a dot 
every vertex that can be reached from this vertex by a lattice-path of even 
length. Every domino then has a dot at the midpoint of exactly one of its 
four sides, and we will assign the domino a heading (north, south, east, or 
west) according to which of its sides (north, south, east, or west) is dotted. 
Thus a horizontal domino is either north-going or south-going and a vertical 
domino is either east-going or west-going; the reason for this terminology will 
become clear in section 2. Headings of dominoes are indicated by arrows in 
the figure. 

Some of the dominoes in Figure 2 have been shaded, namely, those that 
percolate to the boundary by way of dominoes with the same heading. For 
instance, a north-going domino has been shaded if and only if it is possible 
to get from that domino to a north-going domino that shares an edge with 
the boundary of the Aztec diamond by means of a (possibly trivial) sequence 
of north-going dominoes, each of which shares an edge with the one before. 

Observe that the only way a north-going domino can share an edge with 
the boundary is if abuts one of the horizontal edges in the upper half of the 
diamond, and that the presence of such a domino leads to a cascade of north- 
going dominoes reaching all the way up to the top row. It follows easily from 
this that the shaded north-going dominoes form a single connected block, 
which we call the arctic region. (Similar remarks apply to the other four 
"frozen" regions.) The unshaded region is called the temperate zone. 

Define the inscribed circle as the circle of radius n/-\/2 centered on the 
middle of the Aztec diamond. We can now state the main result of this 
paper: 

Theorem 1 (the Arctic Circle Theorem): Fix e > 0. Then for all sufficiently 
large n, all but an e fraction of the domino tilings of the Aztec diamond of 
order n will have a temperate zone whose boundary stays uniformly within 
distance en of the inscribed circle. 

Note that this implies, for all e > 0, that when n is sufficiently large, all 
but an e fraction of the domino tilings of the Aztec diamond of order n will 
have a temperate zone whose symmetric difference with the interior of the 
inscribed circle has less than en . 

To prove the Arctic Circle Theorem, we will use facts about the totally 
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asymmetric exclusion process on Z taking place in discrete time. This is a 
stochastic process whose states are doubly-infinite sequences {...,X-i,xo,xi, 
...) of I's and O's, whose ith term signifies the presence or absence of a particle 
at location i in a 1-dimensional lattice; at each discrete time-step, a particle 
that has a vacancy to its right has a 50% chance of moving one step to the 
right and a 50% chance of staying put. (Note that a particle with another 
particle immediately to its right has no chance of moving until the location 
to its right becomes vacant.) That is to say: If the system at time n is in 
a particular state (..., a;o, ...) and you want to advance the system 
to time n + 1, then find all i such that Xi = 1 and Xj+i = (these i's 
are necessarily non-consecutive), and for each such i, toss a fair coin (with 
independent coins for different values of i); when it comes up heads, put 
x'^ — and x'^^^ — 1, and when it comes up tails leave x'^^ — 1 and x'^^^i — 0. 
Put x[ — Xi for all other i. In this way one obtains a new doubly-infinite 
sequence {...,x'_i,x'q,x[, ...). One repeats this process x \^ x' for infinitely 
many iterations. (Here the "?" is to remind us that x' is a stochastic function 
of X.) 

Now consider the initial condition x* with 



1 ifi<0, 
if i > 0. 



We are interested in how the exclusion process evolves when it is in state x* 
at time 0. 

It will be shown in section 3 that the Arctic Circle Theorem can be 
reduced to the following assertion: 

Theorem 2: Fix a < f3. If one runs the exclusion process starting from 
the state x*. then the number of particles in the interval [an,f3n] at time n, 
normalized by dividing by n, converges in probability to h{a) — h{P), where 



h{u) 



—u for -u < — |, 

^-5n/F^ fo^^ < M < |, and 
for M > |. 



Theorem 2 is quite analogous to Rost's Theorem [7] on the behavior of the 
totally asymmetric exclusion process in continuous time. The quantitative 
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form of the result is different (Rost obtains a parabolic arc where we obtain 
a circular arc), but most of the methods are the same. 

Note that the assertion for | = a < /3 follows immediately from the fact 
that the position of the rightmost particle at time n is binomially distributed 
with mean n/2 and variance n/4. The case of a < /9 = — | is similar (one 
looks at the leftmost vacancy instead of the rightmost particle). Hence all of 
the interest of Theorem 2 lies in the case — |<Q;</3<|. 

To prove Theorem 2, we will need to estabhsh some general facts about the 
exclusion process. Let X denote the set of sequences x = (..., xq, Xi, ...) 
with terms in {0, 1}, and let M.{X) denote the set of probability measures 
on A', relative to the usual Borel ci-algebra on X generated by the cylinder 
sets {x e X : Xi^ = bi,...,Xii^ = bk} {ii,...,ik G Z, bi,...,bk G {0,1}). The 
law governing the exclusion process, run for one time-step, gives a stochastic 
function from X to X, sending x to some x'. If /i G Ai{X) has the property 
that Prob^(2:)(x' G A) = Prob^(2;.)(x E A) = fJ^{A) for all Borel sets A G X, 
then we say fi is stationary under the dynamics. Equivalently, if we define 
the time-evolution map F : /A{X) — > A4{X) by the formula Probi?(^)(a; G 
^4) — Prob^(a;' G A), then is stationary if and only if it is a fixed point of 
F. 

The shift-operator k k + 1 on Z gives rise to a shift-map T on X, with 
(Tx)i = Xi+i for all i; this in turn gives rise to a shift-operation T* on Ai{X), 
with (T*ii){A) = n{T^^A) = Prob^(a;) (Ta; E A). If a measure ii satisfies 
T*iJ, = ji, we say that ji is translation- invariant (or shift-invairiant) . 

If for all Borel sets S and all reals a with < a < 1 we define 

(a/xi + (1 - a)iJ,2){S) = aiJ,i{S) + (1 - a)iJ,2{S) 

and if wc decree that /i„ — z/ if and only if /i„(A) '^{A) for all cylinder 
sets A (this is often called the weak topology), and if we make use of the 
fact that a probability measure on Ai{X) is determined by the measure it 
assigns to cylinder sets (see for instance Theorem 3.1 in [1]), then we can give 
A4{X) an affine structure and a topology. Under these definitions, the set of 
stationary, translation- invariant probability measures in M.{X) is a compact 
convex set. Thus, every element of it can be written as a convex combination 
of extremal elements. 

Given < d < 1, put s = ^d"^ + (1 - d)^, and let 
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with values at the endpoints d = 0, d = 1 being given by continuity. Let /jL^ 
be the unique translation-invariant probability measure on X such that 

IJ.d{{x e X : xo^bo, xi^ bi, ...,Xk = bk}) 

is equal to 

p{bo)q{bo, bi)q{bi, 62) ■ ■ • g(&fc-i, bk) 

for all /c > 1 and all bQ,...,bk in {0,1}. Note that this measure has the 
Markov property: any event that is measurable with respect to .., X_2, X-i 
is conditionally independent of any event that is measurable with respect to 
Xi,X2, ... given Xq. 



Theorem 3: The translation- invariant, stationary probability measures on 
X are precisely those that can be expressed as convex combinations of the 
measures /^d, < d < 1. 

Theorem 3 will be of independent interest to those who study particle 
system models. The proof is similar to the proof that was devised by Rost 
for use in the continuous-time setting; the only non-straightforward trick 
that the discrete-time version of the proof requires is the coupling argument 
presented in section 4. 

The rest of this article is organized as follows. In section 2, we describe a 
combinatorial operation on tilings, called shuffling, that plays a fundamental 
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role in the proof. In section 3, we show that the Arctic Circle Theorem 
(Theorem 1) reduces to our assertion (Theorem 2) about the behavior of the 
discrete-time exclusion process when the initial state is x* . In section 4, we 
classify the stationary, translation-invariant measures in M.{X) (Theorem 3). 
In sections 5 and 6 we show that Theorem 3 implies Theorem 2, which in turn 
causes Theorem 1 to topple into place. In section 7, we modify the notion of 
shuffling to allow for the study of random tilings in which one sort of domino 
(horizontal or vertical) is favored over the other, and we explore the extent 
to which the methods used in the unbiased case still allow one to determine 
the asymptotic shape of the temperate zone. Finally, in section 8, we offer 
some comments that may help the reader to understand why the existence 
of spontaneous large-scale structure in random tilings is not as surprising as 
it might at first seem. 

2. Shuffling. 

For the convenience of the reader, we restate (without proof) the details 
of the shuffling algorithm introduced in [4]. 

Domino shuffling is a stochastic procedure that turns a domino tiling of 
the Aztec diamond of order n — 1 into one of several domino tilings of the 
Aztec diamond of order n. If one starts from the (empty) tiling of the Aztec 
diamond of order and applies shuffling n times, the result is a uniformly 
random domino tiling of the Aztec diamond of order n — that is, each of the 
2n(n+i)/2 filings has probability 2""^'*+^)/^ of being generated. 

Consider an Aztec diamond of order n — 1 tiled by dominoes, where 
each domino has been assigned a heading (north, south, east, or west) as 
described in section 1. When two dominoes share a side of length 2, they 
must be heading in opposite directions. If the arrows point away from each 
other, the two dominoes form a good block; if the arrows point towards 
each other, they form a bad block. 

The shuffling procedure has three steps: destruction, shding, and creation. 
Only the last of the three steps involves randomness. In the destruction 
step, all the bad blocks in the tiling of order n — 1 arc removed. In the 
sliding step, each domino simultaneously moves one step in the direction 
of its arrow. After the sliding takes place, some of the dominoes will no 
longer be inside the Aztec diamond of order n — 1, but all will lie inside the 
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concentric Aztec diamond of order n. It is shown in [4] that no dominoes 
overlap after destruction and shding have taken place, and that moreover the 
resulting configuration of dominoes, viewed as a partial tiling of the Aztec 
diamond of order n concentric with the original diamond of order n — 1, has 
a complement that can be tiled by 2-by-2 squares in exactly one way. In the 
creation step, we fill all these 2-by-2 holes with good 2-by-2 blocks, using 
a fair coin to decide whether the two tiles in any particular hole should be 
horizontal or vertical. The result is a complete domino tiling of the Aztec 
diamond of order n. Moreover, all the arrows in the new tiling are properly 
assigned in preparation for another round of shuffling. (Note that the dotted 
and undotted vertices change places, as is fitting, since we need the middle 
vertex on the upper border of the new Aztec diamond to be dotted when 
we apply the shuffling algorithm.) Figure 3 shows the results of applying 
destruction and sliding to the tiling shown in Figure 2; the untiled portion of 
the Aztec diamond of order n has been divided into 2-by-2 holes in the only 
way possible. When one performs creation, the empty 2-by-2 holes get filled 
in with good blocks, and all the good blocks in the tiling of order n arise in 
this way. 

Shuffling can also be run in reverse: given a tiling of the Aztec diamond 
of order n, one can remove all the good 2-by-2 blocks, slide each remaining 
tile one step in the direction opposite to its arrow, and then fill the 2-by-2 
holes in the resulting partial tiling of the Aztec diamond of order n — 1 using 
bad blocks (each composed of either horizontal or vertical tiles) . We are not 
interested in the reverse algorithm per se; rather, we will treat it as a way 
of seeing which tilings of the smaller diamond can give rise to some specified 
tiling of the larger diamond under the forward procedure. Specifically, if T' 
is a domino tiling of the Aztec diamond of order n with k good 2-by-2 blocks, 
then there 2'^ tilings T of the Aztec diamond of order n — 1 that can give rise 
to T under shuffling. 

To see why iterated shuffling yields the uniform distribution on the set of 
tilings, note that if T is a domino tiling of the diamond of order n—1 that 
has k{T) bad blocks that get removed in the destruction step, then k{T) + n 
good blocks will need to be added in the creation step in order to yield a 
net increase in area of 2n{n + 1) — 2(n — l)n — An. Thus, T gives rise to 
any of 2^^'^'>^^ different tilings T' of the diamond of order n, each with equal 
probability, namely 2"*^^*^"^^+"^ On the other hand, as remarked above, each 
such T' can arise from 2^*^^^ different tilings T. Hence, if we assume (for 
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purposes of induction) that each tihng T of the Aztec diamond of order n — 1 
arises with probabihty 2~("~^)"/^ after n — 1 rounds of shuffling, then after n 
stages, each tihng T' of the Aztec diamond of order n occurs with probabihty 



This completes the verification that the shuffling process indeed generates a 
uniformly random tiling. 

3. Reduction. 

To begin, let us check that the arctic region in a domino tiling of an Aztec 
diamond must have a fairly special sort of shape; specifically, the centers of 
its constituent dominoes must form a Ferrers diagram (rotated so that its 
origin is at the north corner of the Aztec diamond). 

We will do this by giving an alternative characterization of the arctic 
region. Imagine (to make subsequent discussions simpler) that we augment 
the picture by adding some extra dominoes external to the Aztec diamond, 
as shown in Figure 4. Specifically, we fiank each of the first n — 1 rows of 
the Aztec diamond by a horizontal domino on both the left and the right, 
and we put two extra horizontal dominoes immediately above the first row 
of the Aztec diamond, with one more horizontal domino immediately above 
those two. Call these 2n + 1 dominoes the external dominoes, and say 
that they, together with the tiles inside the Aztec diamond, constitute the 
augmented tiling. Now let us shade in the external dominoes, and, in 
each successive row of the Aztec diamond (starting with the first) let us 
shade in every north-going domino that is adjacent to two already-shaded-in 
north-going dominoes in the preceding row. 

We claim that no unshaded north-going domino can share an edge with 
the shaded region. For, suppose the (north-going) domino s in the shaded 
region shares an edge with the unshaded north-going domino u inside the 
Aztec diamond. Let s be the northmost domino with this property. For 
simplicity, we treat only the situation in which s is one of the tiles in our 
original tiling (the argument is similar, but slightly simpler, when s is one 
of the external dominoes). There are three cases, according to whether u 
is in the row above s, the same row as s, or the row below s. The first 
case (shown in Figure 5(a)) is easily disposed of; since s is shaded, both 




■n(n+l)/2 
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dominoes covering it in the preceding row must be shaded, contradicting the 
fact that u is unshaded. In the second case (shown in Figure 5(b)), the fact 
that s is shaded imphes that the domino r that covers both s and u must 
be shaded. But then s is not the northernmost shaded domino in contact 
with an unshaded north-going domino (as r now has this property too). In 
the third case (shown in Figure 5(c)), the fact that s is shaded implies that 
the domino r must be shaded. But then domino t must be in the augmented 
tiling as well. Since u is unshaded, and s is shaded, t must be unshaded; but 
then ,s" is not the northernmost shaded domino in contact with an unshaded 
north-going domino (as r now has this property too). 

It follows that the shaded region does not share an edge with any un- 
shaded north-going dominoes. (One can also see that it does not share a 
corner with any unshaded north-going dominoes either, since north-going 
dominoes never meet corner-to-corner.) Therefore the part of the shaded 
region that lies inside the Aztec diamond coincides with the arctic region, 
defined earlier as the union of the north-going dominoes that percolate to 
the boundary. 

The shaded region in the augmented tiling has the property that each 
shaded domino in the original tiling is covered by two shaded dominoes in 
the row above it, each of which belongs to the augmented tiling. Assign 
coordinates to the centers of the north-going dominoes in accordance with 
a rotated rectangular coordinate system, so that the northernmost external 
domino has center (0, 0) and the two below it have centers (1,0) and (0, 1) 
(going from left to right). Then the centers of the shaded dominoes are 
represented by a subset S of [0,n] x [0,n] that contains (0, A;) and (A;, 0) for 
all < A: < n and has the property that for all G S with 1 < i, j < n, 
both (i — 1, j) and {i, j — 1) are in S. But such sets S are precisely the 
Ferrers diagram of a partition with at most n parts and with largest part at 
most n. 

Now let us see how the arctic region changes under shuffling. No domino 
in the arctic region belongs to a bad block. Thus, north-going dominoes in 
the arctic region slide en masse to form a neighborhood of the "north pole" 
in ever-larger Aztec diamonds, and remain part of the arctic region forever. 
The question is, how do new dominoes get added to the arctic region over 
time? No domino can join the arctic region by sliding; the only way the 
arctic region can grow is by creation of good 2-by-2 blocks. 

To apply shuffling to augmented tilings, one simply applies shuffling to 
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the part of the tihng inside the Aztec diamond, allows the external tiles to 
slide upward, and creates two new external tiles. 

Now consider two horizontally-adjacent north-going dominoes in the aug- 
mented arctic region, as in Figure 6(a). After sliding takes place, the two 
dominoes must be positioned as shown in Figure 6(b). Note that the cells 
marked a and h must be unoccupied at this point, since cast-, west-, and 
south-going dominoes from the old (un-slid) tiling would not slide up that 
high. Hence the cells marked a and h must (in accordance with the results 
proved in [4]) be part of an empty 2-by-2 block (of the kind that becomes a 
good block when it is tiled), namely, the 2-by-2 block shown in Figure 6(b), 
consisting of the cells marked a, 6, c, and d. In the creation step, there is 
a I chance that this empty block will be tiled by two horizontal dominoes, 
in which case a new domino will get added to the arctic region, and there is 
a ^ chance that the empty block will be tiled by two vertical dominoes, in 
which case a new domino does not get added to the arctic region at that lo- 
cation. In terms of the Ferrers diagram, what is happening is that whenever 
(^, j) ^ S with {i — 1, j), — 1) G 5, the node (i, j) gets added to S with 
probability |. 

Henceforth, let us disregard the external dominoes, since they were only 
an aid to analysis and play no further role in the proof. 

It follows from the above argument, via induction on n, that after n 
rounds of shuffling the Ferrers diagram associated with the arctic region 
contains only nodes (i,j) with i -\- j < n\ that is, it sits inside the Ferrers 
diagram of the partition (n, n — 1, 1) (which is associated with the all- 
horizontals tiling). Each Ferrers diagram that sits inside (n, 1) is in fact a 
possible shape of the arctic region, though the different possibilities are not 
equally likely. 

Let us represent the arctic region by a Young diagram rather than by 
a Ferrers diagram. That is, the ith domino-position in the A;th row of the 

arctic region now corresponds to the unit grid-square with lower-left corner 
{k — i,i — 1). For convenience, imagine that the second, third, and fourth 
quadrants are all adjoined to the Young diagram (see Figure 7); call this 
the augmented Young diagram. Note that the Young diagram in the 
figure corresponds to the arctic region shown in Figure 4. Call a square 
that is not in the augmented Young diagram a "growth-square" if both its 
leftward and downward neighbors are in the Young diagram. Growth-squares 
in Figure 7 are marked by dots. 
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The growth-process for Ferrers diagrams (which represents the growth- 
process for possible shapes of the arctic region) is tantamount to the growth 
process for Young diagrams whereby, at each stage, each of the growth- 
squares joins the growing diagram independently with probability |. Fig- 
ure 8 illustrates this by giving the transition probabilities for all Young di- 
agrams with three or fewer boxes. We have omitted the trivial transitions 
in which a Young diagram gives rise to itself, so the outgoing probabilities 
shown do not sum to 1. Note also that the two two-box diagrams admit 
transitions to four-box diagrams that are not shown. 

A further transformation comes from looking at the boundary of the aug- 
mented Young diagram. This is a lattice-path with "endpoints" (0, +oo) and 
(+cxD,0), composed of unit steps downward and to the right. The path ini- 
tially runs straight from (0, -|-oo) to (0, 0) and straight from (0, 0) to (+oo, 0). 
As the tiling undergoes iterated shuffling, the lattice-path evolves in accor- 
dance with the rule that, whenever a downward step is followed by a right- 
ward step, there is a probability of | that "down-then-right" will be replaced 
by "right-then-down" at the next stage, with all such modifications being 
independent of one another. 

We claim that after n rounds of this evolution, the lattice-path L will, 
with probability 1 — o(l), stay within distance o{n) of a single particular 
path, namely, the curvilinear path P that goes from (0, +oo) to (0, ^) along 
a straight line, from (0, |) to (|, 0) along a quarter-circular arc with its center 
at (|, |), and then from (|, 0) to {+oo, 0) along a straight hne. This is just 
a restatement of the main theorem. 

To prove the restatement, it will suffice to prove that, for all slopes m, 
< m < oo, the intersection of L with the line y = mx will be within o(n) 
of the intersection of P with the line y = mx, with probability 1 — o(l). 
For, by choosing a large but finite number of such slopes that are suitably 
distributed between and oo, we will be able to make sure that P and L 
stay uniformly close merely by insuring that they are close at these check- 
points (this argument makes use of the fact that L can only go rightward or 
downward) . 

As a final restatement, let us associate each lattice-path L with a doubly 
infinite string of O's and I's whose ith element (i e Z) is a or a 1 according 

to whether the unique step in L from L n {{x,y) : x — y = i — 1} to 
Ln {{x,y) : X — y = i} goes rightward or downward. Then the operation of 
changing a down-then- right jag into a right-then-down jag in the lattice-path 
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L corresponds to the operation of replacing an occurrence of the substring 
"10" by the substring "01". If we interpret a 1 in position i as indicating the 
presence of a particle at the ith site in a one-dimensional lattice, and a as 
indicating a vacancy there, then this is just a jump-event in an asymmetric 
exclusion process. Figure 9 shows the lattice path of Figure 7 (tilted by 
45 degrees for convenience) and the associated configuration of the particle 
process. 

Transition mechanisms of this kind were studied by Rost [7]. However, 
Rost's dynamics occur in continuous time, while ours take place in discrete 
time. We will thus be able to avail ourselves of Rost's general approach, but 
there will be some differences in the analysis. In particular, the final results 
will not be the same: we will obtain an arc of a circle as the shape governing 
the system's asymptotic behavior, where Rost obtains an arc of a parabola. 

To conclude the link between Theorem 1 and 2, we must show that the 
asymptotic circularity of the lattice-path L described in the preceding para- 
graphs is implied by the formula in Theorem 2. To this end, fix < m < cxo 
and consider a lattice-point {x, y) with y = mx. The assertion that {x, y) 
is on the lattice-path L at time n is equivalent to the assertion that in the 
corresponding state of the particle process at time n, there are y particles to 
the right of position x—y. If the density of particles is as given by Theorem 2, 
then, putting 13 = \ and a = {x — y)/n, we find that the number of particles 

to the right of position x — y is n(^-^ — Equating this with y 

and simplifying, we obtain the relation [x — + (?/ — f )^ = X' describing 
a circle of radius | centered on (f , f ). 

4. The Exclusion Process. 
4.1. Markov measures. 

The random variable Xj is defined as the real-valued function on X = 
{0, 1}^ that sends (...a;_i, Xq, ...) to Xj. We begin by proving that for all 
< d < 1 there is a unique stationary Markov measure ii = such that 

Pi = d 

and 

goi^io = 2goo?ii, 



12 



where 

Pi = Prob^[Xo = i] 

and 

Qij = Prob^[Xi = j\Xo = i]. 

For, a general shift-invariant Markov measure on X is uniquely determined 
by Qqo and qu (since goi = 1 — Qoo and qio — 1 — qu), and the relation 
qoiqio = 2goo?ii yields 

(1 - goo)(l - qu) = 2googii, 
which can be solved for qu: 

1 - Qoo 
Qu = T— ■ 

1 + 900 

So all that remains is to show that the condition pi = d determines a unique 
value of qn. But note that 

Pi = Poqoi+Piqu 
= (1 -pi)goi 

= goi - goi) 

so 

1 - ?ii + goi 

1 + 2goo - ?oo 

It's easy to check that this makes pi a strictly decreasing function of qoo that 
goes from 1 to as qoo goes from to 1. Hence, for each d with < d < 1 
there is exactly one value of goo that yields pi — d, namely 

-d + ^d^ + (1 - d)^ 
1-d 

(when d = 1 this expression is to be interpreted as its limiting value, namely 
0). We will henceforth restrict our attention to the case < d < 1, since no 
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and are trivial. Note that poqoi = PiQio, because the frequencies of the 
strings 0, 1 and 1, must be equaL 

Why this particular definition of //d? Imagine a finite version of the parti- 
cle process introduced earlier, in which the infinite line is replaced by a circle 
of n sites, k of which are occupied by particles. The update-rule is asymmet- 
ric as before: particles may only advance "to the right" (counterclockwise, 
say). The process can be modeled as a Markov chain with (^1^ states, or as 

a random walk on a directed graph with (^1^ nodes. If a configuration of the 
finite particle process has i instances of a particle with a vacancy to its right, 
then the corresponding node of the directed graph has 2' outgoing arcs. On 
the other hand, such a configuration also will have i instances of a vacancy 
with a particle to its right, so the corresponding node of the directed graph 
has 2* incoming arcs. It follows from this that a steady-state distribution on 
the set of nodes is given by a probability measure that assigns to each node a 
probability proportional to 2*. One can show that if one sends k,n to infinity 
with k/n converging to some limit d, then the statistics of these measures on 
circular words converge weakly to the measure ^d- 

Now we wish to show that n = iidis invariant under the update-dynamics. 
That is, we must show that if we choose x = {...,X-i,xq,xi, ...) E X in 
accordance with the distribution n and then evolve the system through one 
time-step to obtain a new doubly- infinite sequence x' G X, the probability 
of the event x' E B (denote this by ^'{B)) should be equal to the probability 
of the event x E B (which is just n{B)), for all measurable events B d X . 
To prove this, it will suffice to prove n'{B) = ii{B) for cylinder sets B of the 
form 

5 = e A" : xo = 6o = 0, xi = hi, X2 = 62, = 6„ = 1} 

with 61, 62, ^'n-i in {0,1}- To see why this suffices, note first that the 

translation-invariance of and of the update-dynamics guarantees that fi' 
must be translation-invariant. Using this fact, combined with finite addi- 
tivity, one can show that if fi'{B) = ii{B) for all the special sets B just 
described, then /i'(-B) = /x(i?) for all cylinder sets B corresponding to bit 
strings that contain somewhere a followed by a 1. However, it is easy to 
see that ix' must assign measure to the set of infinite strings that nowhere 
contain a followed by a 1. Therefore, using countable additivity, one can 
prove ij!{B) = n{B) for all cylinder sets corresponding to finite bit-strings. 
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It follows from this that /i' agrees with /i on all cylinder sets and hence on 
the entire measure-algebra of Borel sets. 

Let N{B) > 1 be the number of occurrences of the substring 0,1 in the 
bit-string b = (60, &n)- There are 2^*^^^ different bit-strings a = (oq, 
that can be formed by replacing none, some, or all of these substrings by 
1,0. A point X E X can evolve in one time-step into a point x' E B if and 
only if (xi, Xn) is one of these 2" bit-strings. Let A = A{a) denote the set 
of X E X satisfying xq = ao, Xn = an, and let M{A) denote the number 
of occurrences of (1,0) in (ao,...,an), Then the probability that an x in A 
will evolve in one time-step into an x' in B is equal to 2"^^*^"^-' times two 
correction factors r{A) and s{A) associated with the beginning and end of 
the string a, respectively. Specifically, if = 1, then the probability that 
this 1 remains a 1 is gn -|- ^gio, which equals 1/(1 -|- goo)- Thus we define 
s{A) to be 1/(1 -|- goo) if ctn = 1 and to be 1 if a„ = 0. Similarly, if ao = 
the probability that the remains a can be shown to be ^(1 + goo), so we 
define r{A) to be |(1 -|- goo) if ao = and to be 1 if ao = 1. Then 

/i'(i?) = EM^) 2-^^(^V(A).(A), 

A 

where the sum is over the 2^*^^^ distinct cylinder sets A associated with the 
bit-strings a described above. 

To prove that fi'{B) = fi{B), it will suffice to prove that each of the 2^(^) 
summands is equal to 2^^'^^^ jj,(B). That is, we will show that every A that 
contributes to the sum satisfies i^{A)/n{B) = 2*^(^)-^(-^V^(^)s(^)- We do 
this by induction on the number of swaps required to turn B into A. In the 
case of swaps {A = B), the formula is true, since M{A) = N{B) — 1 and 
r{A)s{A) = |(1 -|- goo) •!/(! + loo) = |- To get the induction step, we need 
to show that 

Ai(i)//i(A) = 2^^(^)-^(^V(A)s(A)/r(i)s(i) (1) 

when A and A differ only in a single swap (that is, A has 1,0 in two adjacent 
positions in which A has 0,1). 

= 1 is a special case. In this circumstance, we have a = (0, 1) and 
a = (1,0), with fi{A)/fi{A) = 1 on general principles. On the right hand 
side of (0) we get M(i) - M{A) = 1, r{A) = i(l + goo), s(A) = 1/(1 + goo), 
r(A) = 1, and s{A) = 1, so the right hand side of (|l]) is equal to 

(2i)(i(l + goo))(l/(l + goo)) = l. 
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This verifies in the case n = 1; henceforth, we assume n > 1. 

Let us assume that the swap occurs in positions i and i+ 1, with < i < 
i + 1 < n. 

Suppose first that i > and i + 1 < n, so that r(A) = r{A) and s{A) = 
s{A). Then there are four cases to consider, according to the values of aj_i 
and ai+2- If di-i = and aj+2 = 0, then M{A) = M{A). To find the ratio of 
and /i(v4), write 

and 

= Pao<?aoai " •• gOO<?OigiO- • 

The two products are just re-arrangements of one another, so fi{A) / fi{A) = 
1. If ai_i = and a^+s = 1, then M(i) = M{A) + 1 and = 
?oi?io/?oo9ii = 2. If aj_i = 1 and = 0, then M(A) = M{A) — 1 and 
fi{A)/fi{A) = quQml QoiQiQ = |- Lastly, if a^.i = 1 and a^+a = 1, then 
M(i) = M(v4) and /i(i)//i(v4) = L In all four cases, (|l]) is verified. 

When i = 0, there are two cases to consider, according to the value of 02- 
If a2 = 0, fi{A)/fi{A) = piqwqoo/poqoiqio- Using the fact that piqw = poqoi, 
this becomes qoo/qio, which equals r{A). Since M(A) = M{A) and r(A) = 1 
and s{A) = s{A), (|I]) holds. If 02 = 1, fi{A)/n{A) = Piqwqoi / Poqoiqii = 
qo^/qu = 1 + goo = 2r{A). Since M(i) = 1 + M{A) and r(i) = 1 and 
= (p holds. 

When i = n — 1, there are two cases to consider, according to the value 
of a„_2. If a„_2 = 0, fi{A)/fi{A) = gio/goo = 2/(1 + goo) = 2s(A). Since 
M(i) = 1 + M{A) and = 1 and r(i) = r(A), (P holds. If a„_2 = 1, 
= gn/goi = 1/(1 + goo) = s(A). Since M(i) = M(A) and 
s{A) = 1 and r(i) = r{A), (g) holds. 

This completes the proof of (|l|), which completes the proof that fi'{B) = 
IJi>{B) for all our special cylinder sets B. As remarked earlier, this suffices to 
establish that /i' = yu; that is, /i = /x^ is invariant under our evolution rules. 

To conclude, we note for later purposes that /i(xo = 1, a;i = 0) = yu(xo = 

0, xi = 1) = pogoi = 1 - ^<P + {i-dY. 
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4.2. Uniqueness. 

In the previous sub-section we found a one-parameter family of translation- 
invariant, dynamically-stationary probability measures /id- As remarked ear- 
lier, the translation-invariant, dynamically-stationary measures form a com- 
pact convex subset of the compact space We will show that the only 
extremal points of this subset are the measures /x^- This will imply that every 
translation-invariant stationary measure is a convex combination of the /idS. 

Let // be an extremal measure in the set of translation- invariant, dynamic- 
ally-stationary measures, with Prob^[Xo = 1] = d. We must show that 
fi = fJd- Let A4fj__fj_^{X X X) be the space of translation-invariant probability 
measures on X'^^^ x X^'^^ that project to fi on X^^^ and fid on X^'^\ Note that 
A1(U,(Ud('^^'^^ ^ A"^^^) is non-empty, since in particular x /x^ is in it. Let 

S = inf {Prob,[x(^) ^ : tt G M,,,,{X('^ x X^'^)}. 

M.^,^^ is compact in its weak topology, so the continuous function tt i— > 
Prober [-'^o^^ 7^ -'^o^^] achieves its minimum value at some particular tt. Hence- 
forth, TT will denote just such a discrepancy-minimizing measure. Our strat- 
egy will be to show that if 5 > 0, then we can find another joining tt' with 
smaller discrepancy, contradicting the definition of 5. Our argument will be 
symmetrical in X^^^ and X^'^\ 

To construct tt' from tt, we define dynamics on A'^^^ x X^'^^ as follows. Fix 
{x^^\x^'^^) e X^^^ X X^'^\ Say that the spatial indices i and i + 1 are linked 
(relative to {x'^^\ x'^'^'>)) if either of the doubly-infinite strings x^^\ x^"^^ has a 1 
in the ith position followed by a in the i + 1st position. If positions i, i + 1, 
i + 2, j — 1, and j are pairwise linked each to the next, then we say that 
i,i + 1, j are in the same block. In this way, Z is divided into blocks, and 
{x^^\x^'^^) is divided into sub- words (which we will also call blocks), each of 
which consists of k consecutive symbols from x'^^^ and the corresponding k 
symbols from x^'^\ for some k. (It's easy to show that with probability 1 the 
blocks are finite, though in the context of our proof we don't need to worry 
about there being blocks of infinite length.) 

For example: If {x'^^\x^'^^) is 

... 1 1 1 1 1 1 1 ... 
... 1 1 1 ... 
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(with top and bottom rows corresponding to x^^^ and x^'^\ respectively), then 
the blocks are 

... 110 1 10 1 10 1 ... 

... 1 1 1 ... ■ 

To describe the update-dynamics, one need only specify how each block is to 
be updated. 

If a block is of length 1, there is nothing to do. If a block is of length 2, 
then one has no choice about how to couple the two updatc-processes, unless 
the block is of the form 

1 
1 0. 

In this case, we decree that one should use the same random bit to decide 
what to do with the upper substring (from x^^^) and the lower substring 
(from x^"^^). In particular, with probability | the block stays the same, and 
with probability | it becomes 

1 
1. 

For blocks of length 3 or more, which are of the form 

1 1 ... 1 
? 1 1 ... ? 

(or minor variations thereof), one must do something slightly different: we 
decree that in deciding about the ith occurrence of "1 0" in the upper sub- 
string and the ith occurrence of "1 0" in the lower substring, one should use 
not the same bit but complementary bits. That is to say, one should convert 
the ith occurrence of "1 0" in the x^^^-iow of the block to a "0 1" if and only 
if one leaves the ith occurrence of "1 0" in the x^^^-row of the block alone. In 
all other respects one's random choices are to be independent of one another. 

Define a mismatch as a position in which the x^^^ and x^'^^ words disagree. 
If a block has length n > 2, the number of mismatches goes from at least 
n — 2 to at most [(n — l)/2j. This implies that in each finite block, the 
number of mismatches cannot increase. Indeed, in certain kinds of blocks 
(call them "unstable blocks") the number of mismatches will go down with 
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positive probability. The only blocks that do not have this property — the 
"stable blocks," as we'll call them — arc the blocks of length 1 and the blocks 

10 11 00 110 

10, 10, 10, 100 

(and the blocks obtained from them by switching the roles of x^^^ and x^"^^). 

If we let TTn be the probability measure describing the outcome of applying 
the joint dynamics to tt for n time-steps, and we let A = {{x^^\ x^"^^) : 
Xq^"* = x'^Q ^}, then (making use of the shift-invariance of the 7r„'s) we have 
7r(A) > 7ri(A) > 7r2(A) > ... . Since n was chosen to minimize 7r(A) = 6, we 
must in fact have 7r(A) = 7ri(A) = 7r2(A) = ... . This means that unstable 
blocks have probability zero under tt. 

Now, we have 

S = 7r(A) = PtoK[X^^^ = 1, X^^^ = 0] + Prob^[xi') = 0, X^'^ = 1]. 

But 

Prob^[xi^^ = 1, X^^^ = 0] = Prob^[xi^^ = 1] - Prob^fX^^^ 

= d - Prob^[xi^^ = 1, X^^^ = 1] 
= Prob^[xi^^ = 1] - Prob,[xi^) 
= Prob,[xi') = 0, Xi'^ = l]. 

Hence Prob^[X,S^^ = 1, X^^^ = 0] = Prob^[X^^^ = 0, X^^^ = 1] = S/2. 
We claim that this implies that some event of the form X-^^ — 1, xf^"* = 
0, Xj^^ = 0, xj^"* = 1 must have positive probability. For, were this not 
the case, tt would assign probability 1 to U 3^2, where = {(x^^\ x^'^^) : 
Xi^^ > xf'^ for all i} and 3^2 = {(a;*^"^\ x^^-*) : x\^^ < xf^ for all i}. Notice, 
however, that each of these two sets is translation-invariant and is mapped 
into itself by our evolution-rules; if yi and 3^2 each had positive probability, 
then by restricting tt to each of them in turn, we would get two distinct 
measures whose non-trivial weighted average was tt, contradicting the ex- 
tremality of tt. Hence one of the two sets would have to have measure zero. 
But this contradicts the earher- proved fact that Prob^fXQ^-* = 1, X^^^ = 0] 
and Prob^fXQ^-* — 0, X^^^ = 1] are both positive. Hence, as asserted, there 



= 1, ^ = 1] 
= 1, ' = 1] 
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exist i,j such that PToh[Xy ' = 1, X> ' = 0, X} ' = 0, Xf = 1] is positive. 
Without loss of generahty, assume i < j. 

Consider now a two-rowed pattern of length j — i + 1 oi the form 

1 ... 
... 1 

for some fixed (but unconstrained) values of the intervening bits, occurring 
with positive probability under vr. We may assume without loss of generality 
that there are no discrepancies between the respective intervening bits, for if 
this is not the case we can take j — i smaller. Let A be the set of x*^^)) G 
X X X with this pattern at positions i through j. By hypothesis, n{A) > 0. 

Pick {x^^\x'^'^^) E A randomly, in accordance with the measure tt. There is 
a positive probability that in some finite number of steps the intervening bits 
will sort themselves so that the I's are as far to the right as possible while the 
O's are as far to the left as possible. At this point, assuming j — i > 2, either 
the 1 on the left boundary can move to the right (while the other positions 
are unaffected) or the 1 on the right boundary can move to the left (while 
the other positions are unaffected). This reduces the distance between the 
discrepancies by 1. Iterating this until we get down to j — i = 1, we see that 
there exists some n and some i for which Prob7r„[Xj*^^^ = 1, xf^-* = 0, X^^^ = 
0, = 1] > 0. However, such an i would belong either to a block of the 

form 

1 
1 

or to a block of length > 3, and this unstable block would lead to Prob7r„+i (A) 
< Prob^^(A) = 6, contradicting the minimality of 6. Therefore 6 = and 
II = /Id, as claimed. 

This completes the proof of Theorem 3. 

We remark that the preceding demonstration is very similar to the one 
expounded in sections 2 and 3 of chapter VIII of [6] in the case of continuous 
time. However, we have made the proof slightly easier by invoking compact- 
ness in order to choose a discrepancy-minimizing tt at the outset (since we 
then need only show that we can decrease the discrepancy further, rather 
than show that we can reduce it all the way to zero). Also, we have coupled 
/J, directly with rather than with /id' for d' close to d. 
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5. Monotonicity Properties of Particle Density. 



Before we dive into the technicahties that will ultimately afford us a 
rigorous proof of Theorems 1, 2, and 3, we offer a heuristic reason for the 
circularity of the temperate zone, in terms of the behavior of the particle 

process. 

We will assume heuristically that short excerpts from the lattice-path 
exhibit statistics that are nearly Markovian and are governed by some par- 
ticular measure fj,d, where d is not constant but is a slowly varying function 
of position. 

If one examines a piece of the lattice-path that is approximately governed 
by some particular na, one sees that the path approximates a line of slope 
s = — 2i = — Write this hne as y — sx + b. If every point on this hne 
moved 1 unit to the right and 1 unit upward, the equation of the line would 
become y — 1 = s{x—l) + b or y = sx + b+{l — s); that is, the line would move 
upward by 1 — s. Similarly, if every point on the lattice-path moved 1 unit to 
the right and 1 unit upward, the straight line that locally approximates the 
path would appear to slide 1—s units upward. However, when we perform the 
stochastic lattice-path update, only some of the points get moved, namely, a 
proportion of ^piqio (where piqio is the frequency of down-then-right bends, 
and I is the probability that a given down-then-right bend will become a 
right-then-down bend). Hence the lattice-path is expected to move upward a 
mean distance of only |pigio-(l— -s). Butpi^io = PoQoi and 1—s = — 
so the expected vertical displacement is 

1 i_i _ii-^J{i-dy + d^ _{l-s) + VT+^ 

t; PoQoi ■ — — 7;%! — t; ^ — i — 7, ■ 

2 po2 2 1 — a 2 

Now let us assume that when n is large, the lattice-path is an approx- 
imation to some continuous curve ^ = (f){^), where satisfies boundary 
conditions 0(0) = | and 0(|) = 0. Note that under this assumption, the 
lattice-path in the vicinity of the point {X, Y) (with ^ = 4>{^)) should drift 
upward by approximately 

(n+l)0(;^)-n0(f) ^ (n+l)(0(f)-(f-^)0'(f))-n0(f) 

= 0(f)-f0'(f) 

n n I \ n / 
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Putting X = —, y = (j){x) = |, and (/)'{—) = ^ = and equating the two 
drift rates, wc get the differential equation 

dy + + 

y-'^d^^ 2 

for < X < |. It is easy to cfieck tfiat y — ^ + \Jx — is a solution; but 
this is just a quarter-circular arc of the circle {x — + (y — = \- 

It would be satisfying if the preceding heuristic argument could be made 
rigorous; however, we do not know of any general lemmas that would justify 
the approach. Instead, we have resorted to the technique that Rost developed 
in his analysis of the continuous-time exclusion process [7]. In the present 
section we are mainly interested in the limiting behaviour of the X-process 
as one moves out from the origin along a space-time line of slope m, i.e., 
the pictures seen near location un at time n. To study this, we introduce 
a function /i(-) with the property that the number of particles to the right 
of position k at time n is roughly nh{k/n). We then show that h exists 
almost everywhere and is a convex (hence almost everywhere differentiable) 
function of u (Proposition 2). We further show that the derivative exists, 
and is essentially the negative of the probability of finding a "1" at locations 
near un at time n, where n is large (Proposition 3). We then show that 
at a fixed time, the local statistics change monotonely as one moves to the 
right, in the sense that any given pattern of I's is less hkely to appear the 
further one goes to the right (Proposition 4). This result is used to show 
that the limiting behaviour mentioned above is almost everywhere a convex 
combination of the update- and translation-invariant measures studied in the 
previous section (Proposition 5). Finally, we show that these local statistics 
depend in a continuous way on the speed u of the "observer" (Proposition 
6). 

In the section following this one, we obtain a formula for h by means of 
two separate arguments, which give upper and lower bounds for h that turn 
out to coincide. We obtain a lower bound on h by slowing down the lead 
particle, which clearly cannot increase the function h. We obtain an upper 
bound by looking at how fast the "l"s can move when they have a given 
density; the essential idea is that at location un and time n, an upper bound 
on this "average velocity" can be calculated and eventually leads to an upper 
bound on h{u). 
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We have made no attempt to hide our indebtedness to Rost's work; whole 
paragraphs have been hfted from his article with only minor modifications. 
We have done this because we do not see room for many improvements 
in Rost's exposition, and because we wanted our presentation to be self- 
contained. 

As in [7], we will let X be our particle process with state space X = 
{0, 1}"^, except that the updates only occur at discrete moments indexed by 
N. X{k, n) is the state of position k at time n, with /c in Z and n in N. The 
initial state {n — 0) is given by X{k, 0) — x%, where 



The particle initially at location will be called the lead particle. The 
order on points a;, y G A' defined by 



induces a stochastic order on the set of probability measures on X, namely 
^ < v \i and only if 7r({(a;, y) : x < y}) — 1 for some joining tt of and i/, 
where a joining of two probability measures on A" is a probability measure 
OTo. X X X with respective marginals /i and v. A /c-point correlation of ji 
(with /c > 1) is a quantity of the form ji{{x : Xi-^ = Xi^ = ... = Xi^, = 1}), 
with ii < 12 < ... < ik] a measure ii on X = {0, 1}^ is determined by its 
correlations [1]. 

Note that, in the specified initial state x* and in all states accessible from 
X* in finite time, there is a rightmost 1. We may therefore define the process 



with state space S equal to the set of all (weakly) decreasing sequences of 
non-negative integers, and we may further define a component-wise order on 
S and a stochastic order on the set of probability measures on S just as we 
did for X. 

We let C{Y) denote the probability law governing a random variable F, 
we let £{Y) denote the expected value of Y, and we let * denote convolution 
of probability measures on N. 




1 if A; < 0, 
iffc>0. 



X < y if and only if Xi < yi for alH e Z 



S{k,n)=^X{z,n) 



i>k 
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Proposition 1: For all non-negative integers m,n and integers k,l, one 

has 

jC{S{k, m)) * CiSil, n)) > J[:{S{k + l,m + n)). 

Proof: A simple coupling argument shows that the evolution-rule of the 
(S-process preserves stochastic order of measures on its state space S. (In 
the interpretation of particle-configurations as lattice-paths, one state of the 
^'-process is dominated by another if the lattice-path associated with the 
former never crosses above or to the right of the lattice-path associated with 
the latter.) Accordingly, we define a process S that is equal to S up until 
time m but at time m is replaced by 

^, . \ _ J S{k, m) for j > k and 

^ [J: = I ^^^^ m) + {k- j) for j < k; 

that is, all the particles that are to the left of position A; -|- 1 at time m 
simultaneously move as far to the right as possible, so that site k and all sites 
to its left become occupied. After time m, S again evolves in accordance with 
the dynamics of the S'-process. One sees that C{S{-, m+n)) > C{S{-, m+n)). 
But, conditioned on S{k,m), the law of S{k + l,m + n) — S{k,m) {I G Z, 
n > 0) is identical to that of S{l,n) (/ G Z, n > 0) and independent of 
S{k, m). Therefore C{S{k + l,m + n))= C{S{k, m)) * C{S{1, n)). □ 



Proposition 2: For all rt G R, the random variables ^S{[un\,ri) converge 
almost surely and in to a constant h{u) as n — > oo. The function h is 

decreasing and convex; h{u) = for u > ^ and h{u) = —u for m < — |. 

Proof: Proposition 1, coupled with the fact that \um\ + \un\ < \u{m + 
n)\ , gives us 

C{S{\um\ , m) * C{S{\un\ , n)) > >C(S'[M(m -|- n)\ , m + n)). 

The convergence statements in the Proposition follow from the Kesten-Ham- 
mersley theorem [8] . To prove convexity of h, we deduce from Proposition 1 
that for a, P > with a + (3 = 1, 

£S{ [aun\ , an) + £S{ [(3vn\ , (3n) > £S{ [{au + (3v)n\ , n) . 

Dividing both sides by n gives 

ah{u) + (3h{v) > h{au + (3v). 
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If M > |, we have h{u) = 0, as the lead particle moves to the right with mean 
speed |. Similarly, the leftmost vacancy moves to the left with mean speed 
— |, whence h{u) = —u for u < \. □ 



Proposition 3: If h is differentiate at u, then SX[k,n) —>■ —h'[u) when- 
ever n ^ oo with k/n u. 

Proof: We consider the functions defined by 

hn{v) = f £X{\wn\,n) dw] 

Jv 

note that hniy) is roughly equal to ^ YZ=\vn\ SX{k,n). The functions hn{-), 
in addition to being decreasing, are convex (see the first inequality in Proposi- 
tion 5.4) and tend to h{-) as n gets large, where h{v) = lim„^oo [vn\ , n). 
Hence, by an elementary lemma on convex functions of a real argument, 
the desired result holds. Indeed, more is true; as long as h'{u) exists, 
h'^{v) —>■ h'{u) as V —>■ u, where h'j^{v) may be either the right or left derivative 
of hr, at V. □ 



Define f{u) = —h'{u + 0), where h'{u + 0) signifies the right derivative 
of h at u. f{u) is our candidate for the density of the particle process at 
location [wnj at time n, when n is large. 

Let tJi{k^n) = C{X{k + l,n),l G Z). That is, ^{k,n) is the probability law 
governing ■), shifted k positions spatially and n steps into the future. 

Proposition 4: For all k E Z and m, n G N we have 

^{k,n) > fi{k + l,n), (2) 

fj,{k,n + m) < P{m,l)^{k — l,n), and (3) 
I 

fi{k,n + m) > P{m,l)fi{k + l,n), (4) 
I 

where (3{m, ■) is the binomial distribution with mean y and variance ^. 

Proof: n{k+l, n) is the law of X{k+l, n), Z G Z under the initial condition 
x', where x'^ is 1 or according to whether i<— lori>— 1; since x' < x*, 

follows from the monotonicity of the dynamics. 
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Monotonicity also plays a role in the proof of (|^). The position of the 
lead particle at time m is binomially distributed with mean conditioned 
on that position, one compares the original process with the process in which 
all sites behind the first particle are occupied at time m, and which evolves 
according to the usual dynamics after time m. (^) follows from (|]), by 
symmetry between migration of particles and migration of vacancies. □ 

Proposition 5: If h is differentiable at f , any weak limit /i* of the measures 
/i( \un\ , n), n — i> oo, is of the form 

^* = I /^i P{dt) 
Jo 

with p some probability distribution on [0,1], and with fit defined as in 
section 3 (there we called it /i^). 

Proof: n* is stochastically larger than its image under the shift, by Propo- 
sition 4. Since ^""-^ u asn ^ oo, for every fixed k, both /i* and its image 
under the shift have the same one-point correlations, namely f{u) (Propo- 
sition 3). It follows that fi* and its image under the shift are identical. 
But shift- invar iance of fi* implies also its invariance under the action of the 
time-evolution semigroup, if one uses the second and third inequalities of 
Proposition 4. Using the main result of the previous section, we arrive at the 
desired result. □ 

If F is a finite subset of Z, let p{k, F; n) be the probability that X{k + 
i,n) = 1 for all i E F. 

Proposition 6: Assume h is differentiable at u. For any finite set F of 
cardinality n and any e > there exists a 5 > and no G N such that 

\p{[un\,F;n) - p{[un\,F;n)\ < e (5) 

for |m — m| < 6 and n > hq, and 

|p( [un\ , F;n + m) — p{ \un\ ,F;n)\ < e (6) 

for < m < Sn and n > Uq. 
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Proof: First let us prove (^. By symmetry, it suffices to handle the case 
u>u. 

Since fJ,{k,n) is stochastically decreasing in k we need an upper estimate 
only for 

p{[un\,F;n) - p{[un\,F;n). (7) 
But the definition of stochastic order (via coupling) gives us such an estimate: 

^ (p( [u\ , {i}; n) - p( [u\ , {t}; n)) = [un\ +t,n)- X( [mj + t, n)). 

(8) 

Take 6 > such that h'{u + 6) exists and satisfies —h'{u + 5) = f{u + 5) > 
f{u) — ^ (recall that / is assumed continuous at u). Proposition 3 gives 

limM S{X{[{u + 6)n\ +i,n)> fiu) - ^. (9) 

Hence we have, uniformly for u < m + 5, 

limsup^ (p(LMnJ,{i};n) - p( [wnj , {i}; n)) < e/2. (10) 

n^oo 

This implies that (|^ becomes eventually smaller than e, uniformly in m < 
u + 5. 

This completes the proof of (^. The estimate (^ follows from (|^) in 
combination with Proposition 4, inequalities (^ and (|), and the fact that 
/3(s, •) is essentially carried by a set of the form {/ : / < cm} in the limit 
m oo. □ 



6. Identification of tlie Density Profile. 

6.1. Lower bound. 

Let Z{i,n) be the position at time n of the particle originally located at 
position —i, and for > 1 let Y{k,n) = Z{k — l,n) — Z{k,n), the distance 
between the A; — 1st and kth particles from the right at time n. Unfortunately, 
the expected value of Y{k,n) goes to infinity as n gets large, with k fixed, 
so that the law of large numbers cannot be applied as directly as we might 
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like. Rost was able to remedy this problem in a clever way, as we are about 
to see. 

Proposition 7: h{u) > — for \u\ < \. 

Proof: We modify the dynamics of the system as follows: when deciding 
whether to advance the lead particle or not, we use a biased coin, so that 
it advances with probability | with 6 < 1; the other particles advance with 
probability | as before. Expectations with respect to this process will be 
denoted by and its probability law will be denoted by P^. 

These dynamics, in terms of the F-process, may be described as follows: 
The state space is the set of all sequences of positive integers {yi,y2,...). 
Given such a sequence, let cti be a Bernoulli random variable with expected 
value |, and for i > 2 let ctj be a Bernoulli random variable of expected 
value I unless = 1, in which case let ccj be the constant 0. (We can 
think of «j as the indicator function of the event in which the ith particle 
moves to the right.) Then the F-proccss, when in state {yi,y2, ■■■), jumps to 
{yi + cti — ^2, 2/2 + tt2 — ct3, •••)• can check two statements about this 
process: first, it preserves stochastic order; second, an invariant measure is 
j^, defined by the properties that all its coordinates are independent and 
identically distributed and that 

f 1 ifm = 0, 

I Ha^j ifm>0. 

If we compare the modified F-process with initial condition 2/1 = 2/2 = 
... = 1 and the modified F-process with initial condition given by the sta- 
tionary measure 7'', we find that the law of the first process is stochastically 
smaller than the law of the second process at time n — and hence for all 
times. We thus get, for all A; > 1 and n > 0, 

{j:yij,n)] < k E ^\y, >m)^k . 

If we choose k — \_an\ , with < a < 1 fixed, and let n — > 00, we get 

1 / \ 2 - 6^ 

limsup -S^ ^0>) ^ « 7^ — ^ ■ 
n-00 n \^.<„„ j 2 -2b 
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In fact, using the weak law of large numbers for and stochastic domination, 
we find that for any e > 0, 



j<an 

as n — > cxD. For now, hold both e and a fixed. We know that Z{0,n) is 
binomially distributed with mean |, so the law of large numbers, in combi- 
nation with the preceding result and the fact that Z{[an\,n) — Z{0,n) — 
Ej<an^(j», gives 



-Z(LanJ,n)<--a^-e 



. 



The Z-process gets stochastically larger if b is replaced by 1; hence the 
preceding proposition remains true if we replace P'' by P, the probability law 
governing the original dyna mics of the Z-process. This holds for every b. In 
particular, setting b— 1 — ^j^, so that 



2 2-26 2 V V J 



b 

- — a 



(the maximum value of | — a^Zb ^ ^ ranges over [0, 1]), we get 
n 

P 



-Z{ [an} ,n) < - — a — \Ja{l — a) — e 
Expressing this in terms of the ^-process one obtains 
P 



0. 



is 

n 



1 

a 

2 



ail — a) — e] n 



n ] > a 



Since this is true for all e > 0, 



h \ - — a — \l a{l — a) I > a. 



— a — \ a 



Setting u 
\u\ < -5, as claimed. 



[1 — a), we conclude that h{u) > — — for 



□ 
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6.2. Upper bound. 



To complement Proposition 7, we have 



Proposition 8: h{u) < ^ ~ ly i ^ ''^'^ h^l ^ |- 

Proof: First assume u > 0, and put w = 1/u. Assume that u is irrational 
and that h'{u) exists (as must be the case for a dense set of ti's in (0, |)). 
We compute the expected value of S{lun\,n), which is the expected value 
of the number of particles that have passed an observer traveling at speed u, 
minus the number of particles that the observer has passed: 

SS{k,[kw\) = -Y.P[X{[iu\,i)^l, X{\iu],i)^0] 

k 

-J2£X{l,lw). 

1=1 

Multiplying both sides by u/Zc ~ l/[/cwJ and taking the limit, one gets 
fe-»oo \_kw\ 

1 1 ^-^ 

= 2Ji^J^^J;^Y.^^il'^^\^'^)i^o^l,Xl^o) -uf{u). 



=0 

The same argument shows that this formula holds for negative u, too. Thus 

.1 11 

2' 2J" 



this relation holds for a dense set of u's in [— 



Now, by Proposition 5, any limit of 

l^{lun\,n){xo = 1, xi = 0) 

as n — > oo is of the form 

j {1 — \J + [1 — a)^) p{da) with j a p{da) = /(a) 
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(see the formula for the probabihty of the event Xq — 1, Xi = calculated at 
the end of subsection 4.1). Hence by Jensen's inequality we get 

limsnpfx{[un\,n){xo = 1, xi = 0) < 1 - J {f (u))^ + {1 - f (u))^ 

n— >oo 

so that 

iV-l 

i lirn^ Y: l^{lm\,t){xo = 1, = 0) < i - + (1 - /H)' 

Hence 



N-*oo . _ 
1=0 



h{u) < 1 - i^(/(l.)2+(l-/(x.))2 - uf{u). 

Since < f{u) < 1, we have 

h{u)< sup \^^-iJb^ + {l-by-ub]. 

0<b<l I ^ J 

This is maximized at b — ^ — i~fi'^))- Substituting, we get h{u) < 

6.3. Conclusion of Proof. 

Propositions 2, 3, 7, and 8 combine to yield the following density profile 
and law of large numbers: For any u e R, £X{k, n) tends towards f{u) as n 
goes to infinity with k/n tending towards u. The function / is given by 



/(«) 



' 1 for M < — |, 

\ - VCT -\<u<\, and 
for M > \. 



The quantities \ Y.un<k<vn^{.ki'<^) converge almost surely to the constant 
value f{w) dw, for u < V. 

We can now unwind our results to obtain a proof of the Arctic Circle 
Theorem. Under a change in coordinates, the preceding result says that if 
we evolve an infinite lattice-path in the first quadrant in the fashion described 
near the end of section 3, the lattice-path at time n will almost surely attach 
to the X- and y-axes at points (| -|- o(n), 0) and (0, | -|- o(n)), and for 
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all < < ^, the lattice-path at time n will almost surely cross the line 
y/x = ta.n9 at a point (| cos 6' + o(n), |sin6' + o(n)). For each fixed e > 0, 
we can find angles < < ^2 < ■■■ < < f and a number 5 > so 
that any curve that starts at (0, | ± Sn), ends at (| ± Sn,0), only moves 
rightward and downward, and meets each line y/x = tan^j within distance 5 
of (| cos 6i, ^ sin 9i) (for all 1 < i < m) must necessarily stay within distance 
e of the quarter- circle x"^ + y"^ — x,y > 0. This concludes the proof of 
the Arctic Circle Theorem. 

It is also worth pointing out that the measures /id are in fact "attrac- 
tors" under the time-evolution map F : J^{X) if one restricts to 

ergodic translation-invariant measures on {0, 1}^ (that is, those translation- 
invariant measures that cannot be decomposed as convex combinations of 
other translation- invariant measures). For, let /x' be any such measure, with 
IJ,'{{x : Xq — 1}) = d, and take a measure ttq on X x X with marginals /id 
and 11' . Let tTj = ^(ttq) under the coupling-dynamics F : M.{X x X) ^ 
Ai{X X X). We have shown that VTj converges almost surely to a diagonal 
measure with projection Hd on each component; hence the TTj's converge in 
distribution, and in particular, F'^^ii') (the second marginal of tTj) converges 
to Hd- 

7. Introducing Bias. 

Domino tilings have been studied by researchers in statistical mechanics 
in another guise, namely, dimer-patterns on a square grid; see for instance 
[5]. Suppose that, as in Kasteleyn's paper, we introduce an energy function 
that discriminates between horizontal and vertical tiles. It is shown in [4] 
that the Aztec diamond of order n has tihngs with 2k horizontal 

tiles and n{n + 1) — 2k vertical tiles (and no tilings in which the number 
of horizontal tiles or vertical tiles is odd). Thus, if we fix < p < 1 and 
assign measure p''(l — p)"("-+i)/2-'t' each tiling of the Aztec diamond of 
order n with 2k horizontal tiles and n{n + 1) — 2k vertical tiles, we obtain 
a probability measure on the set of tilings that is a Gibbs state relative to 
an energy function that assigns energy — | logp to horizontal dominoes and 
— |log(l — p) to vertical dominoes. 

It is not hard to show that random domino tilings of Aztec diamonds 
under this energy function may be iteratively generated by "biased shuffling," 
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in which an empty 2-by-2 block is filled with two horizontal dominoes with 
probability p and two vertical dominoes with probability 1—p. The temperate 
zone can be defined as before, and simulation suggests that its boundary is 

2 2 

the ellipse — + ^ — 1. 

We can prove part of this assertion rigorously, using the methods intro- 
duced earlier. Each of the four growing "frozen" regions can be associated 
with a Ferrers-diagram (or Young-diagram) growth process. For the north 
and south frozen regions, the probability of growth is p; for the east and 
west, it is 1 — p. In all cases, the growth process may in turn be replaced 
by an asymmetric exclusion process, in which the probability of a particle 
moving to its right (assuming that there is a vacancy there) is equal to the 
growth-rate for the growth-process. 

As in the unbiased case, there exists (for each value of the bias p) a one- 
parameter family of extremal elements in the set of dynamically stationary, 
shift-invariant measures, where the parameter corresponds to density; more 
specifically, we get Markov measures on X = {0, 1}'^ with transition proba- 

bihties qoi = — ^2p(i-d) ' ^lo = — , ^oo = 1 - ^oi, Qn = I - Qw- 

Here p is the growth rate (or drift rate) and d is the density of I's. The 
heuristic method given at the beginning of section 5 can be used to lend 
support to our ellipse conjecture. If we knew that these were the only dy- 
namically stationary, shift-invariant measures, we would be able to deduce 
an "arctic ellipse theorem" . Unfortunately, when p exceeds one-half, the cou- 
pling method of section 4 does not work. Specifically, it is no longer possible 
to devise a coupling under which the number of mismatches is guaranteed to 
weakly decrease over time. One can see this by noting that if p is close to 1, 
then the block 

1 
1 1 

is very likely to give rise to the block 

1 

1 1 

at the next time-step, so that the number of mismatches within the block 
will increase from 1 to 3. If the initial state of the joint process is 

... 1 1 1 1 1 1 ... 
... 1 1 1 1 1 1 ... 
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then the density of mismatches will increase from 1/3 to nearly 1. 

It is nevertheless true that when p is less than one-half, the proof of The- 
orem 3 given above carries over in a straightforward way, showing that the 
Markov measures introduced in the preceding paragraph (with p fixed and d 
varying) are the only stationary translation-invariant probability measures. 
Hence, the north and south frozen regions arc indeed bounded by the pre- 
scribed arcs of ellipses. In the case where p is greater than one-half, the 
heuristic method described at the beginning of section 5 leads one to a dif- 
ferential equation that is satisfied by the ellipse, so we feel confident that this 
is in fact the correct answer. 

It is interesting to note that if one takes the limiting behavior of the 
biased exclusion process in discrete time as p — > 0, the elliptical arc tends in 
shape to a parabolic arc (once a renormalization is made to compensate for 
the fact that the arc is getting smaller) . This is reassuring, since the discrete- 
time exclusion process should "converge" to the continuous-time exclusion 
process as p ^ when time is re-scaled, and since (as was remarked earlier) 
the continuous-time process was shown by Rost to yield a parabolic arc as 
its asymptotic profile. 

8. Conclusion. 

We have shown that a random domino-tiling of a large Aztec diamond 
(unlike a random domino-tiling of a large square) is likely to have large- 
scale structure, and we have given precise information about this structure. 
Outside of a certain critical circle (which we call the arctic circle) , a random 
tiling is likely to exhibit four different sorts of local behavior, associated with 
four particular tilings of the plane. A natural next step would be to describe 
the behavior of the tiling inside the arctic circle. 

In work to be described elsewhere, Henry Cohn, Noam Elkies, and James 
Propp [3] have taken this step, and given a formula governing the behavior 
of random tilings inside the circle. One consequence of their formula is that 
regions within the arctic circle that are macroscopically separated (i.e., sepa- 
rated by distances whose ratio to n is bounded below) exhibit different local 
statistics under random tiling. Thus, the extreme homogeneity of a random 
tiling in the four regions outside of the arctic circle is in sharp contrast with 
the total non-homogeneity that prevails inside the circle. 
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Our proof that the boundary between these two domains is circular is 
more comphcated than we would like. However, if one is content with the 
more modest, qualitative goal of seeing that there must exist some non- 
homogeneity in the statistics of the tiling, then a simple explanation exists. 
Recall from [4] that domino tilings of the Aztec diamond are in bijection 
with certain "height-functions" — integer- vahicd functions on the set of lat- 
tice points internal to the Aztec diamond, satisfying certain local constraints 
and boundary conditions. The local constraints force the height-function to 
satisfy a Lipschitz condition, so that it cannot change too rapidly. Consider 
the real-valued function obtained by averaging all the height-functions that 
correspond to tilings (the "average height-function"). It too will satisfy a Lip- 
schitz condition. The difference in average height between two neighboring 
vertices is an easily-calculated function of the local tihng statistics, so that 
if the local statistics were to be homogeneous, the average height-function at 
lattice-points would have to be well-approximated by a linear function 
ai + bj + c for suitable constants a, b, c. However, one can check that the 
boundary conditions for Aztec diamond height-functions are not consistent 
with any single choice of values for a, b, c, because as one travels around 
the boundary of the Aztec diamond the height alternately increases and de- 
creases. 

If we were to replace the Aztec diamond by a square in the preceding 
analysis, we would find that the height-functions associated with tilings are 
all essentially constant on the boundary of the region, so that one could take 
a — b — and c arbitrary and get a good approximation along the boundary. 
Indeed, Burton and Pemantle's work [2] shows that random domino tilings 
of large squares are statistically homogeneous away from the boundary (in a 
suitable asymptotic sense). We believe that the statistics at the very center 
of the Aztec diamond converge in the limit to Burton-Pemantle statistics, 
but we do not have a proof of this. 

Wc would like to thank David Aldous and Persi Diaconis for helpful conversations. 
Thanks also to Sameera Iyengar, who wrote the first program for generating ran- 
dom domino tilings by the shuffling algorithm. 
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